Session 10: Corpora and Evaluation
نویسنده
چکیده
This session on corpora and evaluation was composed of two distinct parts. Before the break, four papers dealing with a range of important aspects of evaluation of written language systems and spoken language systems were presented. A printed version of each of these papers is included in the conference proceedings. After the break, a series of informal reports (not included as proceedings papers) were given summarizing the work of the Corpora and Performance Evaluation Committee (CPEC) of the DARPA Spoken Language Systems (SLS) Program, with specific reports from several working groups which have been dealing with various aspects of corpora collection and performance evaluation in the SLS Program. A lively and extended discussion followed these working group reports, including presentations of a number of alternate viewpoints.
منابع مشابه
Data Collection And Evaluation
This session focussed on two inter-related issues: (I) performance assessment for spoken language systems and (2) experience to date in speech corpora collection for these systems. The session included formal presentations from representatives of SRI International, MIT's Laboratory for Computer Science, BBN Systems and Technologies Corporation, and Carnegie Mellon University's School of Compute...
متن کاملCLEF 2017 Dynamic Search Lab Overview And Evaluation
In this paper we provide an overview of the first edition of the CLEF Dynamic Search Lab. The CLEF Dynamic Search lab ran in the form of a workshop with the goal of approaching one key question: how can we evaluate dynamic search algorithms? Unlike static search algorithms, which essentially consider user request’s independently, and which do not adapt the ranking w.r.t the user’s sequence of i...
متن کاملCWI at TREC 2012, KBA Track and Session Track
We participated in two tracks: Knowledge Base Acceleration (KBA) Track and Session Track. In the KBA track, we focused on experimenting with different approaches as it is the first time the track is launched. We experimented with supervised and unsupervised retrieval models. Our supervised approach models include language models and a string-learning system. Our unsupervised approaches include ...
متن کاملSession 1: Lexicons, Corpora, and Evaluation
Our technologies for collecting, storing, and disseminating vast amounts of information have gotten ahead of our technologies for collating and analyzing it, and that situation has posed a serious challenge for human language technology. As a consequence, natural language processing has been moving rapidly towards large-scale systems addressed to real tasks. Demos that won't scale up are no lon...
متن کاملاستخراج پیکره موازی از اسناد قابلمقایسه برای بهبود کیفیت ترجمه در سیستمهای ترجمه ماشینی
Data used for training statistical machine translation method are usually prepared from three resources: parallel, non-parallel and comparable text corpora. Parallel corpora are an ideal resource for translation but due to lack of these kinds of texts, non-parallel and comparable corpora are used either for parallel text extraction. Most of existing methods for exploiting comparable corpora loo...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1991